Overview of the Netarkivet web archiving system
نویسنده
چکیده
The Netarkivet web archiving system is creating to fulfill our obligation as national archives to collect and preserve Danish internet material. It uses the Internet Archive’s Heritrix crawler to harvest data, but surrounding that is a in-house developed system to automate harvests and archive the results. This article presents a rough sketch of the overall system along with some more detail on the module that defines and runs harvests.
منابع مشابه
Preserving the bits of the Danish Internet
One of the many challenges in large-scale web archiving is long-time preservation of the amounts of data generated by the harvester. We describe the bit preservation solution chosen by Netarkivet. We then define a programmatic, probabilistic model of hardware failures and repair operations in the solution. The mean time to failure of this model is then computed in a number of experiments based ...
متن کاملBlogForever: From Web Archiving to Blog Archiving
In this paper, we introduce blog archiving as a special type of web archiving and present the findings and developments of the BlogForever project. Apart from an overview of other related projects and initiatives that constitute and extend the capabilities of web archiving, we focus on empirical work of the project, a presentation of the BlogForever data model, and the architecture of the BlogF...
متن کاملStudy of the Attitude of Users towards Picture Archiving and Communication System Based on the Technology Acceptance Model in Teaching Hospitals of Qom, Iran
Background and Objectives: Many healthcare providers use health information technology to improve their performance. Picture Archiving and Communication System is a subsystem of the health information system that aims to facilitate the storing, archiving, and managing of digital images as well as their transmission. In this regard, measuring the level of acceptance of technology can be very hel...
متن کاملUsers’ satisfaction with imaging services before and after the implementation of picture archiving and communication system
Introduction: The picture archiving and communication system is a digital device designed for processing, archiving and communicating medical images with different parts of hospitals, physicians and radiologists. Therefore, the current study aimed to determine the impact of the system on users’ satisfaction with imaging services before and after its implementation. Methods: This cross-secti...
متن کاملThe Web-at-Risk at Three: Overview of an NDIIPP Web Archiving Initiative
The Web-at-Risk project is a multi-year National Digital Information Infrastructure and Preservation Program (NDIIPP) funded effort to enable librarians and archivists to capture, curate, and preserve political and government information on the Web, and to make the resulting Web archives available to researchers. The Web-at-Risk project is a collaborative effort between the California Digital L...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006